Data compression and genomes: a two-dimensional life domain map.
نویسندگان
چکیده
We define the complexity of DNA sequences as the information content per nucleotide, calculated by means of some Lempel-Ziv data compression algorithm. It is possible to use the statistics of the complexity values of the functional regions of different complete genomes to distinguish among genomes of different domains of life (Archaea, Bacteria and Eukarya). We shall focus on the distribution function of the complexity of non-coding regions. We show that the three domains may be plotted in separate regions within the two-dimensional space where the axes are the skewness coefficient and the curtosis coefficient of the aforementioned distribution. Preliminary results on 15 genomes are introduced.
منابع مشابه
Optimization of Hot Workability in Ti-IF Steel by Using the Processing Map
Processing map for hot working of Ti-IF steel has been developed in the temperature range of 750 to 1100 °C and strain rate of 0.01 to 100 s-1. This map in the austenite region exhibits a single domain with a peak efficiency of 45% occurring at 1025 °C and strain rate of 0.02 s-1. The domain extends over the temperature range of 1000 to 1100 °C and strain rate range of 0.01 to 1 s-1. The true s...
متن کاملEstimation of PC-MRI Pressure Map Using Integral Form of Governing Equations and Spline Segments
In this paper, the boundary-based estimation of pressure distribution in the cardiovascular system is investigated using two dimensional flow images. The conventional methods of non-invasive estimation of pressure distribution in the cardiovascular flow domain use the differential form of governing equations. This study evaluates the advantages of using the integral form of the equations in the...
متن کاملEstimation of PC-MRI Pressure Map Using Integral Form of Governing Equations and Spline Segments
In this paper, the boundary-based estimation of pressure distribution in the cardiovascular system is investigated using two dimensional flow images. The conventional methods of non-invasive estimation of pressure distribution in the cardiovascular flow domain use the differential form of governing equations. This study evaluates the advantages of using the integral form of the equations in the...
متن کاملPrediction of dispersed mineralization zone in depth using frequency domain of surface geochemical data
Discrimination of the blind and dispersed mineralization deposits is a challenging problem in geochemical exploration. The frequency domain (FD) of the surface geochemical data can solve this important issue. This new exploratory information can be achieved using the interpretation of FD of geochemical data, which is impossible in spatial domain. In this research work, FD of the surface geochem...
متن کاملINTERACTIVE GRAPHICAL DESIGN OF TWO - DIMENSIONAL COMPRESSION SYSTEMS Brian
The paper gives an automated procedure to design rational decimation compression systems that resample two-dimensional bandpass signals at their Nyquist rates. The procedure takes a sketch of the passband in the frequency domain, circumscribes it with a parallelogram, and linearly maps the parallelogram onto one period of the frequency domain. Thus, the compression system only has linear compon...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of theoretical biology
دوره 253 2 شماره
صفحات -
تاریخ انتشار 2008